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Abstract 

We apply a PDE-based method to deduce the critical time and the size of 
the giant component of the "triangle percolation" on the Erdos-Renyi random 
graph process investigated by Palla, Derenyi and Vicsek in ||4|, ||9l. 



1 Introduction 

In this Note we investigate triangle percolation in the Erdos-Renyi random graph 
and other related graphs. The model was detined in Q and fOl by Derenyi, Palla, 
and Vicsek in order to simulate phase transition of overlapping communities in real 
networks. They also considered the more general case of ^-clique percolation, but 
we restrict our attention to the ^ = 3 case. 

The Erdos-Renyi random graph G{N,p), detined in 15], is a random subgraph 
of Kff, the complete graph on A'^ vertices: The edge set of G{N,p) is chosen at 
random in the following manner: we declare every edge occupied with probability 
p, otherwise we call the edge vacant. If N is very large and /? = it is well- 
known that the random graph undergoes phase transition: if we consider the size 
distribution of the connected components, then a giant component will emerge at 
tc = 1 (this is called the critical time in the k = 2 case). Because of the mean 
field property of the graph, it is possible to relate the distribution of the size of 
the connected component of an arbitrary vertex to the total population of a Galton- 
Watson branching process with a. X = t parameter Poisson offspring distribution 
(see e.g. lEl). 

The branching process (and in the N ^ o° limit, the random graph) becomes 
supercritical when t, the expected number of first generation offspring (i.e. the 
number of neighbors of the root) exceeds 1 . The limiting component-size distribu- 
tion of the random graph and the density of the giant component can be determined 
explicitly with the generating function method. 
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Vicsek et al. generalize this idea to determine the critical time of a different 
type of phase transition: if we consider the triangle subgraphs of the Erdos-Renyi 
random graph and declai^e two triangles connected if they have a common edge, 
then a different time-scale is needed to see the emergence of a connected triangu- 
lated giant component: the branching process method makes sense in the A/^ — > oo 
Umit if p = and = is the critical time of the ^ = 3 case, see 191. 

A different approach to arrive at the same results (in the k = 2 case) is to view 
the evolution of the random graph as a stochastic process: if every edge of the graph 
turns from empty to occupied state with rate ^, independently from one-another 
then at time t we will see G{N,p = l—e^Ti) which is asymptotically the same as 
G{N,p = ^) as A'^ — > oo. It is easy to relate the evolution of the random graph to the 
mean field stochastic model of coagulation, the Maixus-Lushnikov process, which 
converges to the solution of the Smoluchowski coagulation equation (with multi- 
plicative kernel), see (in historical order) Q, ||3l, |[T1, O. A useful way to handle 
the Smoluchowski equation is by taking its Laplace-transform (which is essentially 
the same as using generating functions): the transformed differential equation be- 
comes a well-known PDE, the Burgers equation (see O, [Sl, HI), which can be 
solved explicitly by using the method of characteristics. 

The method of branching processes cannot be applied to a slightly modified 
random graph process, where the dynamics are the same but the initial graph is 
an arbitrary graph (and not the empty graph as in the Erdos-Renyi case), but the 
PDE method generalizes to these models both in the k = 2 and the k = 3 case. In 
this Note we derive a (non-Unear, non time-homogeneous, first order) PDE which 
describes the evolution of the — > oo limit of the triangulated component- size den- 
sities of a general mean-field random graph, and give an explicit solution to that 
PDE, which enables us to calculate the critical time and the size of the giant trian- 
gulated component for an arbitrary initial component size-distribution. 

2 Definitions 

In the rest of this Note, we will look at the random graph processes on the following 
time-scale: G{N,p « "^)» or more briefly G{N,t): vacant edges turn occupied 

independently with rate The edge set of G(A'^,f) is denoted by E{N,t). In the 
Erdos-Renyi case, the initial graph has no edges, but it can be itself a random graph 
in the general case. 

In order to describe the triangle-structure of the graph, let us define an auxiliary 
graph, G{N,t) with vertex set E{N,t), and two vertices of G{N,t) be connected by 
an edge if the coiTcsponding e,/ G E{N,t) are edges of the same triangle whose 
third edge is also in E{N,t). We call the connected components of G the triangu- 
lated components of G. The "size" or "weight" of a triangulated component is its 
size in G. 

If e G E{N,t), denote by S{e,t) the size of the triangulated component of e. 
Let us denote by Cn{N,t) the number of triangulated components of weight n. 
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Summing the total weight of components we get the total number of edges: LneN ^ ' 
Cn{N,t) = \E{N,t)\, and we can define 

m\{t) ■= lim l^^^'/^l =m\{0)+t (2.1) 

by the law of large numbers. 

If we define c„ (N, t) = ''"^^/^ , then it is natural to expect that lim/v^oo c„ (A'^, t) = 

Cn{t) exists and is a deterministic nonnegative real number for each n and t. Of 
course, there is a minimal criterion for this to hold: the sequence of initial random 
graphs must have the property that the limits limAr^ooC„(A'^,0) = c,j(0) exist. We 
need additional assumptions: 

The asymptotic mean field independence property, or briefly A. M.F.I. P.: Let 
us choose a subset E' C E{N,t) with l^'l <C (for example E' can be the set of 
edges connected to vq G V{G)), explore the connected components of the edges 
of E' in G{N,t) and denote the edges contained in the explored triangles by E'. 
The A.M.F.I.P. is satisfied if the probability distribution of S{f,t), where f ^ E' 
is asymptotically independent from the distribution of the explored components 
as A/^ — > oo. Note that this property is weaker then the "branching process" prop- 
erty: there asymptotic independence holds even after a smaller exploration step: if 
e ^ E', then the number of triangles that contain e, but are not contained in E' is 
asymptotically independent of E'. 

We also assume that our sequence of mean field initial graphs has the asymp- 
totic trivial structure property, or briefly A.T.S.P.: let us define another auxiliary 
graph, G{N,t). V{G{N,t)) consists of the triangles of G{N,t) and two vertices are 
connected if the corresponding triangles share an edge. The A.T.S.P. means that 
asymptotically almost surely the connected component of an arbitrarily chosen ver- 
tex of G{N,t) is either the (unique) giant component of G{N,t) or a tree. 

An immediate consequence of the A.T.S.P. assumption is that c„(?) > only 
if n is in O, the set of positive odd numbers: if we remove a vertex from a tree in 
G{N,t), then we remove exactly two edges of E{N,t). Denote by 0°° = O U {°°}: 
in the mean field limit, the size of the triangulated component of an edge e is in 
0°°, S{e,t) = oo means that e is an edge of the triangulated giant component. 

Let us define the Laplace transform (generating function) C{t,x) = Y,neO ^n{t) • 
e-" "^ for ? > and x>0. C{t,x) = 'LneO- c„{t) ■ e-"-\ since e""-^ = 0. Denote 
the partial derivatives of C with respect to t and xhy C and C', respectively. It 
is convenient to define v„(f) = n ■ c„{t) and V{t,x) = L„eo^n(0 -e^"'^, so that 
C'{t,x) = —V{t,x) holds. Under the assumption of the mean field properties, the 
function C will satisfy the following PDE: 

C{t,x) = e^i^'^r-'^'W)'--' -2V{t,x)- ml {tf {12) 

After solving the PDE, we will be able to express c„{t) as a function of the 
initial data. Let us emphasize that m\{t) := L„eoV"(0 = / m\{t) for all 
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t (although we do assume that mi(0) = /Mj(0)), because the N ^ oo Hmit and the 
52„eo summation are not interchangeable in the supercritical phase: the breakdown 
of equality indicates the presence of a giant component, because a positive portion 
of edges is missing if we sum the weight of small components: 

mi{t)+Voo{t) =m\{t) =mi{0)+t (2.3) 

where v„{t) ■ ^N^ is, up to leading order, the weight of the giant component of 
G{N,t). 



3 Derivation of the PDE 

In order to derive (12.21 ). we need some more definitions. 

Let us orient the edges of in an arbitrary way, so that we can talk about the 
"initial" and "final" endpoints of each edge. If e is the new edge that we are about 
to occupy at time t, with initial and final endpoints u and v, then the "vicinity" of e 
can be described by a two- variable function p'^ : 0°° x O" —>■ N, where p[{i, j) is the 
number of vertices w such that w, u and v form an "(/,7)-type cheny": both {w, w} 
and {v,w} are in E{N,t), moreover S{{u,w},t) = i and S({v,w},?) = j. / = oo or 
7 = oo is an admissible choice, because we want to take into account those edges in 
the vicinity of e that belong to the giant triangulated component. When e becomes 
occupied, all the components in the vicinity of e merge into one component of size 
1 +L,.jPe(',7) • {i + j), since the merged non-giant components are distinct by the 
A.T.S.P. The value of Cn{t) changes by 

I[ « = 1 + £ p1 (/, ;•) • (/ + ;■)]-£ p1 (/, «) - £ P^ {nj) 

•J i j 

We can give the probability distribution of p^ for an arbitrary e when — > oo 
using the A.M.F.I.P. For each w, 

P(S({„.W.<) = 0»^^ = ^. 

up to leading order. Also F{S{{u,w},t) = oo) ?a Using similar estimates and 

the A.M.F.I.P., the probability that w, u and v form an (/,7)-type cherry is ''-^^^-jf^, 
up to leading order. When N ^ °°, the number of (/, j)-type cherries in the vicinity 
of e has Poisson distribution and their joint distribution is the product measure: for 
any fixed r : 0°° x O" ^ N, the probability of the event {ViV; p'^ (/,;') = 
(or briefly {p^ = r}) is 



ijel <'JV- Jel 

We can now start to derive the differential equation (I2.2II . 
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Between t and t + dt, approximately ^dt edges become occupied, and their 
contributions to the change of Cn{N,t) are independent again by the A.M.F.I.P., so 
we may use the law of large numbers to describe the evolution of the component- 
size vector: 



Cn {N, t + dt)- Cn {N, t)^E {Cn {N ,t + dt) - Cn {N,t))^ 

£ (l[l+Y^r{iJ) . (i + j) = n]-l^r{i,n)-l^r{n,j)^ •P(pl ^ r)^dt 

3 

If we divide both sides by ^dt, let ^ oo and dt ^0 and take the Laplace- 
transform of both sides, then the left-hand side becomes C{t,x). Let us calculate 
the Laplace-transform of the right-hand-side. The first term is 



£ .-'-£l[l +£r(/,i) • (/ + ;•) = n].npi = r) = 

neO°° r i.j 

r r i.j r(/,7)! 

ij >-=0 ij 



The second term: 



£ e-"-£ £ r(/,n)P(p^ = r) = £ £ WeiUn)) = 

neO°° r ;eO°° neO ieO°° 

£e-'- £ Vi{t)v„{t)=m\{t)V{t,x) 

The third term is handled in the same way. 

Putting these equations together we arrive at (12.21) . It is convenient to use the 
shorthand notation W{t,x) := for the Laplace transform (generating 

function) of the size of the component we get by occupying a vacant edge at time 
t. Note that W{t,0+) < 1 indicates that this probability distribution is defective in 
the supercritical case, since m\{t) —V {t ,0+) = Voo{t) > 0. 



4 Solution of the PDE 

It is possible to give an explicit solution to (12.21) with the method of characteristics. 
Differentiating the PDE with respect to x and rearranging the equation we get a 
first order PDE for V: 

V + V' ■ (e^'-'"'^^'^'-'' -IV -2ml{t)) =e^'-'»IW'-^ (4.1) 
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Let us consider the following ODE with initial condition x(0) = x: 



(4.2) 



If we define y{t) = V{t,x{t)), then v(0) = V{0,x). Putting (gl]) and (g^ll to- 
gether we get a system of differential equations that can be solved without knowing 
V{t,x) in advance: 



In order to solve these equations explicitly, define w(?) = e^^'^^ "(O _ 



Thus y/{t) is constant: W{t,x{t)) = W{0,x), \{t) is linear: \{t) = V{0,x) +t ■ 
W{0,x), and x{t) is quadratic: 

x(f) =x + (y{0,x)+t- W{0,x)f - V{0,xf - (mi (0) + tf + mi (0)^ (4.3) 

If we start with x(0) = 0, then x{t) = and \{t) = mi(0) +t = m\{t), but 
we know that V(f,0+) ^m\{t) if t > Tg. This breakdown of analiticity is due to 
the intersection of characteristics: another characteristic curve x{t) starting at x 
intersects the x{t) = curve at the time when (14.3b becomes zero: the intersection 
time t solves the following equation: 

■ {W{0,xf -\)+t- {2V{0,x)W{0,x) - 2mi (0)) +x = 

If we let X ^ in this equation, the solution will converge to t = Tg, the first 
time when another characteristic curve hits 0. We have to divide all the coefficients 
by X, use mi(0) = m*(0) = V(0,0) = l„eov«(0), /7i2(0) = -V'{0,0) = L^^o" • 
v„(0) and W{0,0) = 1 to get —2 times the following equation as x — > 0: 

■ (2m2(0)mi(0) + l) + Tg- (m2(0) +mi(0) • (2m2(0)mi(0) + 1)) - ^ = (4.4) 

As a special case, if we start from the empty graph, then nii (0) = and m2(0) = 
0, thus we get Tg = which agrees with the critical time obtained in 111 and 191, 
by using the branching process method. If G{N,0) is uniformly chosen from all 
graphs with mi(0)^A'^2 edges (where mi(0) < -^), then this graph is asymptoti- 
cally the same as the Erdos-Renyi graph at time t = mi{0), thus m2(0) = i_'2'»i"(o)^ 
and = — mi(0). This result can also be obtained by the branching process 
method. 

An example of a sequence of initial random graphs that have the A.M.F.I.P., but 
do not have the branching process property: let G{N,0) be chosen uniformly from 
all triangle-free graphs that have nii (0) ^N^- edges. In this case m2(0) = mi (0), and 




W(, 



[t,x{t)). 



w{t) = w(0 • (2v(?)v(0 - 2ml{t) - x{t)) = 
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the Tg of this graph is greater than the Tg of the Erdos-Renyi graph with the same 
;7ii(0), but smaller than that of the Erdos-Renyi graph with the same m2(0). This 
follows from the fact that the solution of the equation (14.41 ) decreases if we increase 
mi(0) or m2(0). 

In order to express the value of Voo{t), let us define X{t,w), the inverse function 
of W{t,x) in the x variable and V{t,w) ='V{t,X{t,w)). X(f,w) is well-defined 
and is a decreasing function of w on the interval (0,W(f,0)]. Since w{t) remains 
constant along the characteristics, V(?,w) = V{0,w) + tw and 

X{t,w)=X{Q,w) + {V{Q,w)+twf-V{Q,wf-{mi{Q)+tf + mi{Qf 

is expressed explicitly given the initial data. W(?,0) is the smallest w such that 

l(f,w) =0, wdvoo{t) = m\{t)-mi{t) = V{0,\) + t -V{0,W{t,0))-tW{t,0). 
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